D-PROV: Extending the PROV Provenance Model with Workflow Structure

نویسندگان

  • Paolo Missier
  • Saumen C. Dey
  • Khalid Belhajjame
  • Víctor Cuevas-Vicenttín
  • Bertram Ludäscher
چکیده

This paper presents an extension to the W3C PROV provenance model, aimed at representing process structure. Although the modelling of process structure is out of the scope of the PROV specification, it is beneficial when capturing and analysing the provenance of data that is produced by programs or other formally encoded processes. In the paper, we motivate the need for such extended model in the context of an ongoing large data preservation project, DataONE, where provenance traces of scientific workflow runs are captured and stored alongside the data products. We introduce new provenance relations for modelling process structure along with their usage patterns, and present sample queries that demonstrate their benefit. © 2013 Newcastle University. Printed and published by Newcastle University, Computing Science, Claremont Tower, Claremont Road, Newcastle upon Tyne, NE1 7RU, England. Bibliographical details MISSIER, P., DEY, S., BELHAJJAME, J., CUEVAS-VICENTTIN, J., LUDAESCHER, B. D-PROV: extending the PROV provenance model with workflow structure [By] Paolo Missier, Saumen Dey, Khalid Belhajjame, Victor Cuevas-Vicenttin, Bertram Ludaescher Newcastle upon Tyne: Newcastle University: Computing Science, 2013. (Newcastle University, Computing Science, Technical Report Series, No. CS-TR-1375)

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SHARP: Harmonizing Cross-workflow Provenance

PROV has been adopted by a number of workflow systems for encoding the traces of workflow executions. Exploiting these provenance traces is hampered by two main impediments. Firstly, workflow systems extend PROV differently to cater for system-specific constructs. The difference between the adopted PROV extensions yields heterogeneity in the generated provenance traces. This heterogeneity dimin...

متن کامل

Workflow Provenance Repository

Scientific workflows and their supporting systems are becoming increasingly popular for compute-intensive and data-intensive scientific experiments. The advantages scientific workflows offer include rapid and easy workflow design, software and data reuse, scalable execution, sharing and collaboration, and other advantages that altogether facilitate “reproducible science”. In this context, prove...

متن کامل

A Software Framework for Data Provenance

Data provenance refers to the historical record of the derivation of the data, allowing the reproduction of experiments, interpretation of results and identification of problems through the analysis of the processes that originated the data. Data provenance contributes to the evaluation of experiments. This paper presents a framework for data provenance using the W3C provenance data model, call...

متن کامل

Trust and Risk Relationship Analysis on a Workflow Basis: a Use Case

Trust and risk are often seen in proportion to each other; as such high trust may induce low risk and vise versa. However, recent research argues that trust and risk relationship is implicit rather than proportional. Considering that trust and risk are implicit, this paper proposes for the first time a novel approach to view trust and risk on a basis of a provenance data model (W3C PROV) applie...

متن کامل

An Online Validator for Provenance: Algorithmic Design, Testing, and API

Provenance is a record that describes the people, institutions, entities, and activities involved in producing, influencing, or delivering a piece of data or a thing. The W3C Provenance Working group has just published the prov family of specifications, which include a data model for provenance on the Web. The working group introduces a notion of valid prov document whose intent is to ensure th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013